A 2 2/3 Superstring Approximation Algorithm

نویسندگان

  • Chris Armen
  • Clifford Stein
چکیده

Given a collection of strings S = fs 1 ; : : : ; s n g over an alphabet , a superstring of S is a string containing each s i as a substring; that is, for each i, 1 i n, contains a block of js i j consecutive characters that match s i exactly. The shortest superstring problem is the problem of nding a superstring of minimum length. The shortest superstring problem has applications in both data compression and computational biology. It was shown by Blum et al. 3] to be MAX SNP-hard. The rst O(1)-approximation algorithm also appeared in 3], which returns a superstring no more than 3 times the length of an optimal solution. Prior to the algorithm described in this paper, there were several published results that improved on the approximation ratio; of these, the best was our algorithm ShortString, a 2 3 4 {approximation 1]. We present our new algorithm, G-ShortString, which achieves an approximation ratio of 2 2 3. Our approach builds on the work in 1], in which we identiied classes of strings that have a nested periodic structure, and which must be present in the worst case for our algorithms. We introduced machinery to describe these strings and proved strong structural properties about them. In this paper we extend this study to strings that exhibit a more relaxed form of the same structure, and we use this understanding to obtain our improved result.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel and Sequential Approximations of Shortest Superstrings

Superstrings have many applications in data compression and genetics. However the decision version of the shortest superstring problem is NP-complete. In this paper we examine the complexity of approximating a shortest superstring. There are two basic measures of the approximations: the compression ratio and the approximation ratio. The well known and practical approximation algorithm is the se...

متن کامل

Approximating the Shortest Superstring Problem Using de Bruijn Graphs

The best known approximation ratio for the shortest superstring problem is 2 11 23 (Mucha, 2012). In this note, we improve this bound for the case when the length of all input strings is equal to r, for r ≤ 7. For example, for strings of length 3 we get a 1 1 3 -approximation. An advantage of the algorithm is that it is extremely simple both to implement and to analyze. Another advantage is tha...

متن کامل

Rotations of Periodic Strings and Short Superstrings

This paper presents two simple approximation algorithms for the shortest superstring problem, with approximation ratios 2 3 ( 2:67) and 2 42 ( 2:596), improving the best previously published 2 4 approximation. The framework of our improved algorithms is similar to that of previous algorithms in the sense that they construct a superstring by computing some optimal cycle covers on the distance gr...

متن کامل

Better Approximation Algorithms for Maximum Asymmetric Traveling Salesman and Shortest Superstring

In the maximum asymmetric traveling salesman problem (Max ATSP) we are given a complete directed graph with nonnegative weights on the edges and we wish to compute a traveling salesman tour of maximum weight. In this paper we give a fast combinatorial 3 4 -approximation algorithm for Max ATSP. It is based on a novel use of half-edges, matchings and a new method of edge coloring. (A half-edge of...

متن کامل

A 223 { Approximation Algorithm for the Shortest

Given a collection of strings S = fs 1 ; : : :; s n g over an alphabet , a superstring of S is a string containing each s i as a substring; that is, for each i, 1 i n, contains a block of js i j consecutive characters that match s i exactly. The shortest superstring problem is the problem of nding a superstring of minimum length. The shortest superstring problem has applications in both data co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 88  شماره 

صفحات  -

تاریخ انتشار 1998